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ABSTRACT 

This research aims at using vehicular ad-hoc networks as 
infra-structure for an urban cyber-physical system in order 
to gather data about a city. In this scenario, all nodes are 
data sources and there is a gateway as ultimate destina- 
tion for all packets. Because of the volatility of the network 
connections and uncertainty of actual node placement, we 
argue that a broadcast-based protocol is the most adequate 
solution, despite the high overhead. 

The Urban Data Collector (UDC) protocol has been pro- 
posed which uses a distributed election of the forwarding 
node among the nodes receiving the packet: nodes that are 
nearer to the gateway have shorter timers and a higher for- 
warding probabilities. The performance of the UDC pro- 
tocol has been evaluated with different suppression levels 
in terms of the amount of collected data from each road 
segment using NS-3, and our results show that UDC can 
achieve significantly higher sensing accuracy than to other 
broadcast-based protocols. 

1. INTRODUCTION 

Vehicular ad-hoc networks (VANET) have moved into the 
spotlight driven mainly by the benefits expected from safety, 
traffic management and infotainment applications [7j. In 
this paper we propose a different utilization, namely to use 
a VANET as the infra-structure for an urban monitoring 
system, an approach that has not been explored so far. Ve- 
hicles equipped with a wide range of sensing devices and the 
ability to communicate with each other offer a unique oppor- 
tunity for gathering real-time data about a city, like traffic 
conditions [9], environmental parameters, video and audio 
for surveillance [8] , or physical condition of the drivers [l6] . 
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Knowing the updated state of relevant variables for a city 
is not only critical for applications such as navigation using 
real-time traffic information both for regular and for emer- 
gency vehicles, but also for personal mobility support and 
environmental monitoring [17] . 

The IEEE has recently released the 802. lip standard for 
VANET [2], which is derived from the 802.11a and 802. lie 
standards. The 802. lip PHY uses the 5.9 GHz frequency 
band with channels of 10 MHz. Additionally, 802. lip MAC 
does not require node association for communication, and no 
handshakes or acknowledgments are foreseen on the control 
channel. 

The purpose of urban sensing is to provide information about 
an urban area to entities outside the VANET in near real- 
time. Each node collects information about its environment 
and sends it periodically to the gateway, configuring a many- 
to-one network topology, where all nodes are simultaneously 
both data generators and forwarders and the ultimate des- 
tination of the data lies outside the VANET. 

Several protocols for data collection in wireless sensor net- 
works (WSN) and mobile adhoc networks (MANETs) have 
been previously proposed, but VANET links are more volatile 
and multi-hop paths have very short durations 19 , in ad- 



dition to the lack of reliability at the link level due to the 
highly dispersive environment. In this scenario, the proto- 
cols developed for WSN incur a very high path management 
overhead [14] and are not adequate. On the other hand, 
existing VANET protocols focus on unicast or data dissem- 
ination, and the scenario described has not been previously 
addressed. 

This article proposes and evaluates a data collection protocol 
over an urban VANET. Our main contributions are 1) a 
new data collection protocol for data gathering in urban 
area; 2) the evaluation of different suppression levels to limit 
the amount of forwarders at each hop; 3) the benchmarking 
of broadcast-based protocols using a metric appropriate to 
measure sensing performance. 

Our results show that significant gains can be obtained from 
probabilistic forwarding to further reduce the replication of 
packets at each hop while there are significant amount of 
packets from each road segment received at the gateway for 
a wide range of vehicle densities. 



2. DATA COLLECTION CHALLENGES 

The envisioned scenario is a city environment with moving 
cars which are collecting sensor data about the city, e.g. 
noise or air pollution, and the VANET is used as sensing 
infra-structure for gathering and sending the data to a back- 
office outside the VANET for monitoring purposes. This 
configures a many-to-one communication pattern, and it is 
assumed each node knows its own geographic position and 
that of the gateway. 

In a city, cars move along the streets with unpredictable 
patterns and buildings block the communication between 
vehicles in non-aligned streets, creating a very volatile en- 
vironment, where a single direct link lasts on average 20 s 
[19| . Data collection protocols for wireless sensor networks 
and MANETs foresee some movement and provide mecha- 
nisms for path re-establishment, but they would need to be 
used too often in such volatile scenarios causing too high an 
overhead. 

On the other hand, the amount of vehicles in each street seg- 
ment is highly variable in time and across different streets, 
and not feasible to be estimated in real-time by vehicles in 
other streets with low overhead. This makes it difficult to 
use source routing to take the packet towards the destina- 
tion, because it is not possible to know at the packet source, 
or anywhere along the way, whether there are vehicles that 
can serve as forwarders in the street segments between any 
vehicle and the gateway, causing packet die out with high 
likelihood. 

Furthermore, during traffic congestion, communication can 
be very difficult due to an overly loaded shared medium. 
In this situation, which is common in urban areas, requir- 
ing additional message exchange for coordination purposes 
increases further the medium load. 

A protocol for data collection in an urban scenario using 
vehicle-to- vehicle communication must take all these limita- 
tions into account. 



3. PROTOCOL DESIGN 

The proposed protocol is Urban Data Collector (UDC) pro- 
tocol, a data gathering protocol that has the following fea- 
tures: I) it does not require nodes to exchange periodic 
messages with their neighbors; 2) it uses the 802. lip MAC 
layer; 3) it takes advantage of redundant forwarding to in- 
crease packet delivery to the gateway; 4) it limits the amount 
of redundancy using suppression techniques. 

The UDC protocol is a network layer protocol that can be 
run on top of the recently standardized 802. Hp MAC. It is 
a broadcast- and receiver-based protocol, so that it can take 
advantage of any node that receives a forwarded packet with- 
out requiring exchange of information about node present 
in the neighborhood. This is especially relevant because the 
MAC layer does not provide acknowledgments, and it is not 
easy for a forwarder node to know whether its packet has 
been received by another node. Next we exemplify why some 
well-known solutions are not adequate for data gathering in 
urban areas, and explain how our protocol addresses those 
issues. 




Figure 1: UDC help packets to get to the gateway 
when a path in the direction of gateway is blocked 
by a building. 

Figure [T] illustrates a case where a greedy protocol cannot 
deliver a packet to the gateway (shown as an antenna on 
the right bottom) because a chosen forwarder does not have 
any neighbor in the direction of the gateway. Protocols like 
GPSR If use perimeter routing technique to recover from 
the blocked path and find another path to the gateway, but 
these introduce a high delay and cause a great deal of addi- 
tional transmissions. Because the proposed protocol follows 
two paths simultaneously (black and gray paths in Figure]!]), 
although the black path is blocked, the gray path provides 
a redundant path which avoids packet die out. 




Figure 2: UDC avoid packet die out when an acci- 
dent happens on the direction of the gateway. 

In Figure [5] the main path (black path) to the gateway is 
blocked because the packet reaches a node that has no other 
nodes in range. This can happen because of the high volatil- 
ity of the network, where the node that sent the packet is no 
longer in range after the routing dead-end has been detected. 
In this case, the perimeter technique will not be able to re- 
cover from this situation because there are no other nodes in 
the communication range of the single vehicle that has the 
packet. Because our protocol uses both the main path (black 
path) and the redundant path (gray path) for forwarding the 
packet the likelihood of both dying out is reduced. 

Existing protocols get stuck in a blocked road or a road with- 
out forwarders toward the gateway because most of them 
take a binary decision for selecting the next forwarder. UDC 
gives the forwarding opportunity to several neighbor nodes 
to create redundant paths to avert packet die out. Finally, a 
fully distributed suppression technique that requires no co- 
ordination among nodes has been used to limit the amount 
of redundant paths so as to reduce the overhead. This is ac- 
complished by taking advantage of two different types of for- 
warding: directional forwarding and probabilistic forward- 
ing. Directional forwarding means that each node, upon 
receiving a packet, forwards it in the direction of the gate- 
way after the expiration of a timer, which is used for giving 



forwarding priority to nodes that represent more progress 
towards the gateway. Probabilistic forwarding means that 
each node forwards the received packet with a probability 
which depends on difference distances between itself and the 
previous node to the gateway. The next sections describe in 
detail how these two techniques are combined to provide the 
desired functionality. 

3.1 Directional Forwarding 

UDC uses a network layer timer calculated as a function 
of the difference between the distance of the previous and 
potential forwarder to the gateway to prioritize forwarders 
nearer to the gateway. The timer is calculated at each node 
in a distributed manner based only on information contained 
in the header of the received packet and of the node itself. 




Figure 3: One-hop forwarding entities 



Figure [3] illustrates the scenario used to explain the calcula- 
tion of the timer: node i is the sender of the packet and nodes 
1, 2 and j are receivers and potential forwarders. The com- 
munication range is divided in two different areas, marked 
white and gray in Figure [3] Nodes in the gray area are 
nearer to the gateway than the previous hop, whereas nodes 
in the white area are farther away. Nodes that represent the 
most progress towards the destination should be preferred 
forwarders. These are the nodes located closer to the end of 
the transmission range in the direction of the gateway, hence 
they shall have shorter timers and forward earlier than nodes 
farther away from the gateway. 

The time coefficient is calculated according to Equation [l] 
This timer coefficient lies in range [0, f] and is then multi- 
plied by the maximum timer T max to deterministically cal- 
culate the actual timer value at all potential forwarders (Fig- 
ure ??). 



C= (1.0 + 



dj - — di — R 
2R 



(1) 



where dj is the distance of node j to the gateway, di the 
distance between the previous hop and the gateway, and R 
is the communication range. 

3.2 Suppression Techniques 

Suppression techniques are mechanisms that effect the re- 
duction of the number of packets re-forwarded at each hop. 
The simplest suppression mode is that any node that hears 
a packet being forwarded while waiting for timer expiration 
stops the timer and discards the packet. In this way, the 
node thats correctly receives a packet and is closest to the 
gateway will be the first to forward, and any node that hears 



that forwarding will not double forward. But nodes on dif- 
ferent sides of the previous hop may not hear each other, i.e., 
nodes in the white area do not hear nodes in the grey area 
in Figure [3] and cause duplicated forwarding. This problem 
could be solved by limiting the forwarder nodes to nodes 
that are nearer to the gateway than the previous hop. But 
this is likely to raise a problem for low node density, causing 
packet die out in the more likely event of lack of a forwarder 
nearer to the gateway. 

Four different levels of suppression has been studied to over- 
come these problems: basic, weak, moderate and strong sup- 
pression. In the basic method, nodes discard a packet if they 
receive the same packet during the timer or wait time (see 
Section [3. 3[ ), meaning that it has been forwarded by a node 
which is closer to the destination. 



Forwarding Probability 




Figure 4: Forwarding Probability calculated for dif- 
ferent suppression techniques 



In weak suppression, UDC reduces the number of forwarders 
by using probabilistic forwarding for the nodes with higher 
distance to the gateway (white area in Figure ??), while 
nodes closer to the destination (gray area in Figure ??) for- 
ward the packet with probability one. Nodes in the white 
area forward a packet with a probability that decreases with 
increasing distance to the gateway, calculated according to 
Eq.[2] 



Pfwd = (1-0 - 



dj — di 
R 



(2) 



In strong suppression, nodes in the gray area, which are 
ranked by timer, forward the packet with different proba- 
bility, calculated according to Eq. [3] and nodes in the white 
area do not forward the packet at all. 



Pfwd = 



, di — dj 
X R 



(3) 



An intermediate level, moderate suppression, is added for 
which the forwarding probability is increased linearly with 
decreasing distance of potential forwarders to the gateway, 



calculated using Eq. [4] 



Pfwd = ( 



di — dj + R. 
2R ' 



(4) 



Figure [4] shows the forwarding probability of different sup- 
pression levels where the x-axis shows the difference distance 
between the last hop and potential forwarders to the gate- 
way (di — dj) relative to the communication range (R). 

The complete network layer forwarding protocol algorithm is 
described in Figure [5] Upon receiving a packet, node checks 
if it is not a gateway and consequently if it did not receive the 
packet before, it calculates the timer (section 3.1 1 and in the 
end of the timer according to the probability (section |3.2[ ) 
forwards the packet. 



r 




- Calculate the timer 

- Schedule the packet forwarding 
on the timer 

- After the timer, forwards the 
packet with probability according 
to the suppression mechanism 



Figure 5: Protocol algorithm 



3.3 Channel Access Time 

The average CSMA/CA channel access time obtained from 
simulation of UDC protocol with different suppression tech- 
niques is illustrated in Figure [6] The average channel access 
time for all node densities for 4 different suppression tech- 
niques was 5 ms. This time is embedded in the protocol by 
letting each node wait for this duration before starting the 
timer at the network layer, to allow the suppression mecha- 
nisms to hear packets forwarded by other nodes. 

4. SIMULATION 

The Network Simulator 3 (NS-3) [l] version 3.9 is used to 
simulate a 2.5 x 2.5 km Manhattan Grid topology with 5x5 
roads, adding up to a total of 30km road length. There 
is one gateway located in the center of the topology, at 
(1250 m, 1250 m), and all other nodes are data sources. 
The communication range is set to 500 m using a path-loss 
exponent of 2.5, and each node has on average at least four 
nodes and at most 84 nodes (crossroads with 42 nodes/km) 
within its communication range. The data rate is set to 
1 kbps withlOO Bytes packets, which is sufficient for envi- 
sioned sensing applications [l7]. The other simulation pa- 
rameters can be found in Table Q] 
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Figure 6: Average channel access time for 4 sup- 
pression techniques. The slotted 1-persistence proto- 
col used for comparison is introduced in Section [6j 



Table 1: Simulation Set up 



Parameter Name 


Value 


Node Density 
Average Speed 
Minimum Speed 


4, 8, 16 and 42 nodes/km 
14 m/s 
3 m/s 


MAC protocol 
Channel Data rate 
Channel Bandwidth 
Communication Range 
Propagation Model 


802. lip 
6 Mbps 
10 MHz 
500 m 
Nakagami-m (m=1.56) 



Two seconds warm-up time is considered at the start of the 
simulation before reaching steady-state. The simulated time 
is set to guarantee that each node generates at least 20 pack- 
ets. UDC with four different suppression levels (Section |3.2[ | 
and the maximum timer equal to T max = 5ms is used for 
simulation. The slotted-1 persistence protocol [20j is em- 
ployed for comparison because it showed the best perfor- 
mance in a benchmark of existing protocols for data gath- 
ering using VANET in [l5] (see Section [BJl. 

5. PERFORMANCE EVALUATION 

The performance of UDC is evaluated using the following 
metrics: 



• Sensing accuracy: In the city environment for hav- 
ing an explicit view about the city, the urban sensing 
application needs having information about each road 
segment. Then the application can extract useful in- 
formation about each road segment and expand it from 
road segment to entire city. So the sensing accuracy is 
defined to show how obtained information is accurate 
and can be applicable for the urban sensing. There- 
fore, it has been defined as number of received packets 
from each road segment in a second from all nodes 
within that section. To calculate it, after analyzing 
the collected data at the gateway, the received packets 



are separated depending on the source's location. 

Network efficiency: The question of how well the 
protocol acts in the broadcast fashion is defined as 
network efficiency. This metric shows how many of the 
packet forwardings are used for the packets which get 
to the sink. The defination is: the number of hops for 
each packet receives at the sink, including both unique 
packets and their replicas, divided by the number of all 
forwarded packets in all nodes. 



Additionally, extensive simulations were run to determine 
the different reasons for the packet discards at each node, 
and the evaluation of these results is shown on this section. 

5.1 Sensing accuracy 

Figure [7] shows the sensing accuracy for the UDC with dif- 
ferent suppression levels. It plots the average number of 
received packets from each road segment at the gateway in 
one second windows in the simulated road topology against 
varying node densities. It shows that the sensing accuracy is 
most dependent on the node density when using the strong 
suppression level. For the lowest node density, it discards 
more than 87% of the received packets at each hop and is 
not able to produce enough redundancy paths, causing high 
probability of packet die out. As the node density increases, 
the strong suppression level represses about 97% of received 
packets at each hop and thus reduces the number of colli- 
sions, achieving the highest sensing accuracy for the highest 
node density. 
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Figure 7: Sensing accuracy for different data collec- 
tion mechanism in urban area 



the highest accuracy for the lowest node density, makes them 
a better choice for the envisioned application. The moderate 
suppression level has a sensing accuracy that lies midway 
between the performance of weak and strong, showing better 
performance than strong suppression at low node density 
and better performance than weak suppression at high node 
density. 

5.2 Network Efficiency 

Figure [8] shows that the UDC with strong suppression level 
uses broadcast forwarding in the most efficient way, since a 
larger percentage of the packets forwarded in the network 
end up reaching the sink. More interestingly, the weak sup- 
pression level shows a higher network efficiency than the 
basic level while achieving similar sensing accuracy (see pre- 
vious section) since the lower amount of forwarded packets 
causes a lower number of collisions. The moderate suppres- 
sion level again lies midway between strong and weak sup- 
pression levels, as intended at its construction. These results 
also show that the efficiency of the protocol drops with in- 
creasing node density due to the high amount of collision 
caused by a higher medium load, as expected for a broad- 
cast protocol in a shared medium. 
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Figure 8: Efficiency for using different suppression 
techniques 



5.3 Packet Drop Analysis 

An extensive simulation is carried out to analyze the rea- 
sons of performance drop for increasing node density, with 
the aim of gaining insight into the possible ways to improve 
the performance of the protocol. Figure [9] identifies the dif- 
ferent possible reasons for packet discards at the different 
communication layers. 



On the other hand, the weak and basic suppression level 
achieve similar sensing accuracy with less dependency to the 
node density. Since the sensing application needs to obtain 
information from each road segment independently of the 
node density, a protocol with less variations in the sensing 
accuracy for different node densities is a better choice for a 
sensing application. This fact and the fact that both provide 



In this analysis, only the packets that do not get to the 
sink is considered and the drop reasons for these packets is 
counted. Table [2] showsthe reason why the last replication 
of a packet still alive in the network dies out before reaching 
the gateway. This table also shows that the main reason for 
dropping the packets is excessive suppression, i.e. suppres- 
sion of packets that would have been useful for reaching a 
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Figure 9: Different reasons for drop packets depend 
on different communication layers 



higher sensing accuracy. 

Also, even for low node densities, suppression and collision 
are the main causes of packet die out, and not low SNR, 
i. e. packets do not die out because of lack of forwarders, 
but due to the broadcast nature of the protocol, which causes 
collisions and requires suppression. On the other hand, the 
single main cause for packet die out for the strong method 
is suppression, for any node density. 

Moreover, we conclude that an intermediate solution be- 
tween the weak and strong suppression techniques would 
achieve a better balance between the collision and suppres- 
sion trade-offs. So UDC with moderate suppression level is 
proposed as the best compromise, in accordance with the 
results of the performance evaluation in Section [5] 



Position-based Multi-hop 
Emergency Message Dis- 
or 



Stateless Routing (GPSR) [TT 
Broadcast Protocol(PMBP) [4 

semination for Vehicular environments (EMDV) |18_ 
Cross Layer Broadcast Protocol(CLBP) |3j. Sender-oriented 
protocols are inefficient for data gathering in urban areas due 
to the high overhead of continuously exchanging beacons 
and because additional mechanisms must be added to verify 
at each hop whether the chosen forwarder actually received 
and forwarded the packet. Furthermore, explicitly choos- 
ing a single forwarder in urban scenarios is not adequate for 
urban data collection, as we discussed in Section [2] 

Receiver-oriented protocols forward data packets without 
prior message exchange and employ mechanisms to reduce 
the network overhead. Protocols like Contention-based for- 
warding (CBF) 6 or Efficient Directional Broadcast (EDB) 
[13| use contention-based forwarding without handshaking. 
Upon receiving a packet, all nodes wait for a timer propor- 
tional to the geographic progress towards the gateway, but 
both require the exchange of further messages per packet 
and hop. CBF uses request-to-forward (RTF) and clear-to- 
forward (CTF) messages to reduce duplicate packets and 
EDB sends an ACK message before forwarding. 

Finally, [20] proposed three broadcast-based protocols that 
use basic per-hop forwarding and suppression techniques for 
data dissemination: Weighted p-Persistence, Slotted-1 Per- 
sistence and Slotted p-Persistence broadcasting. We eval- 
uated the performance of these protocols for data collec- 
tion over VANET [15] and the Slotted-1 Persistence proto- 
col shows better performance than the two others. Hence, 
we used it for comparison and introduce it here. 



Table 2: Percentage of each reasons in the protocol 
performance for different node densities 



Density 


Sup. 


Sup. 


Col. 


Low 


PDR 


nod. /km 


Method 


% 


% 


SNR% 


% 


4 


Strong 


81.06 





0.46 


18.48 




Weak 


19.23 


6.15 


1.4 


73.22 


8 


Strong 


48.01 


0.02 


0.04 


51.93 




Weak 


15.73 


6.92 


0.21 


77.14 


16 


Strong 


39.11 


0.22 


0.02 


60.65 




Weak 


28.64 


11.93 





59.43 


42 


Strong 


55.27 


0.79 





43.94 




Weak 


53.79 


33.59 





12.62 



6. RELATED WORK 

Existing solutions for VANET sensing either apply on-demand 
querying for local dissemination [l2] , sometimes keeping the 
data in the location [5], or rely on delay-tolerant network- 
ing and open Wi-Fi access points for sending the data to 
the Internet backbone 10 . The first are inefficient for real- 



timemonitoring due to the query overhead and the need to 
globally access the data, and the latter cannot guarantee 
up-to-date data. 

On the network level, a wide range of broadcast-based vehicle- 
to-vehicle unicast routing protocols has been proposed and 
we categorize them in two classes: sender-oriented and receiver- 
oriented. Sender-oriented protocols locally exchange bea- 
cons to obtain information. It enables the current forwarder 
to select the next forwarder node among its neighbors. This 



Slotted-1 persistence uses probabilistic and slotted forward- 
ing. Each node j, upon receiving a packet from node i, 
re-broadcasts the packet at a timeslot Tg t . if it receives the 
packets for the first time and does not receive any dupli- 
cates before the assigned timeslot, otherwise it discards the 
packets. Ts M is calculated by Ts tj = Sij x r, where r is the 
estimated one hop delay and $y is the assigned slot number, 
calculated by: 




R 



(5) 



Dij > R 



where N s is the predetermined number of slots, Dij is the 
distance between nodes i and j and R is average communi- 
cation range. 

7. CONCLUSION AND FUTURE WORK 

We envision the usage of a VANET as infra-structure for 
an urban cyber-physical system that makes up-to-date data 
about various parameters of an urban area available to ser- 
vices outside of the network. We propose and evaluate the 
use of UDC protocol, a broadcast- and receiver-based for- 
warding protocol. 

The effect of four different suppression techniques on the 
sensing accuracy and network overhead is evaluated using 
NS-3 large scale simulation. The results reveal that for sup- 
porting the sensing applications in the urban area, the weak 
suppression increases the sensing accuracy with lower de- 



pendency to the node density. Also it shows that the weak 
method has less excessive suppression than other methods 
and is the best solution of those evaluated for urban sensing. 

As part of ongoing efforts, the study of network limitation 
and its effect on the sensing accuracy will be investigated. 
Subsequently, the effect of the number of gateway on the 
sensing accuracy will be explored. 
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